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Abstract 

There are two natural metrics defined on an arbitrary convex cone: Thompson's part 
metric and Hilbert's projective metric. For both, we establish an inequality giving infor- 
mation about how far the metric is from being non-positively curved. 

1 Introduction 

Let C be a cone in a vector space V. Then C induces a partial ordering on V given by x < y 
if and only if y — x S C. For each x 6 C\{0}, y G V, define M(y/x) := inf{A 6l:i/< \x}. 
Thompson's part metric on C is defined to be 

dr(x,y) := logmax \M{x/y), M{y/x) 

and Hilbert's projective metric on C is defined to be 

d H (x,y) := log (M(x/y)M(y/x) 



Two points in C are said to be in the same part if the distance between them is finite in the 
Thompson metric. If C is almost Archimedean, then, with respect to this metric, each part of 
C is a complete metric space. Hilbert's projective metric, however, is only a pseudo-metric: it 
is possible to find two distinct points which are zero distance apart. Indeed it is not difficult to 
see that du{x, y) = if and only if x = Xy for some A > 0. Thus dn is a metric on the space of 
rays of the cone. For further details, see Chapter 1 of the monograph |23|. 

Suppose C is finite dimensional and let S be a cross section of C, that is S := {x 6 C : 
l{x) — 1}, where I : V — > M is some positive linear functional with respect to the ordering on 
V. Suppose x,y £ S are distinct. Let a and b be the points in the boundary of S such that a, 
x, y, and b are collinear and are arranged in this order along the line in which they lie. It can 
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be shown that the Hilbert distance between x and y is then given by the logarithm of the cross 
ratio of these four points: 

j / \ i IM \ay\ 
d H {x 7 y) = logfr-p — 
\by\ \ax\ 

Indeed, this was the original definition of Hilbert. If S is the open unit disk, the Hilbert metric 
is exactly the Klein model of the hyperbolic plane. 

An interesting feature of the two metrics above is that they show many signs of being 
non-positively curved. For example, when endowed with the Hilbert metric, the Lorentz cone 
{(t,xi, . . . ,x n ) e R n+1 : t 2 > x\ + ■ ■■ + x^} is isometric to n-dimensional hyperbolic space. 
At the other extreme, the positive cone W]_ := {(x±, . . . , x n ) : Xi > for 1 < i < n} with either 
the Thompson or the Hilbert metric is isometric to a normed space which one may think 
of as being flat. In between, for Hilbert geometries having a strictly-convex C 2 boundary with 
non-vanishing Hessian, the methods of Finsler geometry |28j apply. It is known that such 
geometries have constant flag curvature —1. More general Hilbert geometries were investigated 
in |17j where a definition was given of a point of positive curvature. It was shown that no 
Hilbert geometries have such points. 

However, there are some notions of non-positive curvature which do not apply. For example, 
a Hilbert geometry will only be a CAT(O) space (sec 6 ) if the cone is Lorentzian. Another 
notion related to negative curvature is that of Gromov hyperbolicity . In , a condition is 
given characterising those Hilbert geometries that are Gromov hyperbolic. This notion has also 
been investigated in the wider context of uniform Finsler Hadamard manifolds, which includes 
certain Hilbert geometries |12| . 

Busemann has defined non-positive curvature for chord spaces 7 . These are metric spaces 
in which there is a distinguished set of geodesies, satisfying certain axioms. In such a space, 
denote by m xy the midpoint along the distinguished geodesic connecting the pair of points x 
and y. Then the chord space is non-positively curved if, for all points u, x, and y in the space, 

d(m ux ,m uy ) < ^d(x,y), (1) 

where d is the metric. 

In the case of the Hilbert and Thompson geometries on a part of a closed cone C, there 
will not necessarily be a unique minimal geodesic connecting each pair of points. However, it is 
known that, setting j3 :— M (y/x; C) and a := l/M(x/y; C), the curve <fi : [0, 1] — ► C : 




I3a s - a(3 s 



x, if /3 ^ a, 



[s;x,y) :={ \ p-a J" ' \ P-a J ' ' ' ' (2) 
a s x, if P = a 

is always a minimal geodesic from x to y with respect to both the Thompson and Hilbert 
metrics. We view these as distinguished geodesies. If the cone C is finite dimensional, then each 
part of C will be a chord space under both the Thompson and Hilbert metrics. Notice that the 
geodesies above are projective straight lines. If the cone is strictly convex, these are the only 
geodesies that are minimal with respect to the Hilbert metric. For Thompson's metric, if two 
points arc in the same part of C and are linearly independent, then there are infinitely many 
minimal geodesies between them. 

In this paper we investigate whether inequalities similar to Q hold for the Hilbert and 
Thompson geometries with the geodesies given in (J5J). We prove the following two theorems. 
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Theorem 1 Let C be an almost Archimedean cone. Suppose u,x,y E C are in the same part. 
Also suppose that < s < 1 and R > 0, and that dn(u, x) < R and duiu, y) < R. If the linear 
span of {u, x,y} is 1- or 2- dimensional, then dx [<t>(s; u, x) , <f>{s] u, y)) < sdT(x,y). In general 



d T ((f)(s;u,x),<f>(s;u,y)) < 



2(1 



1-e- 



d T (x,y). 



(3) 



Note that the bracketed value on the right hand side of this inequality is strictly increasing in 
R. As R — > 0, this value goes to s, which reflects the fact that in small neighborhoods the 
Thompson metric looks like a norm. As R — > oo, the bracketed value goes to 2 — s. 

Theorem 2 Let C be an almost Archimedean cone. Suppose u,x,y E C are in the same part. 
Also suppose that < s < 1 and R > and that dn{u,x) < R and dn(u,y) < R. If the linear 
span of {u,x,y} is 1- or 2- dimensional, then dji (4>(s; u, x) , 4>(s-, u, y)) < sdH{x,y). In general 



d H ((/>(s;u,x),(f)(s;u,y)) < 



-Ra 



1 



d H (x,y). 



(4) 



This 



Again, the bracketed value on the right hand side increases strictly with increasing R. 
time, it ranges between s as R — > and 1 as R — > oo. 

Our method of proof will be to first establish the results when C is the positive cone M^, 
with N > 3. It will be obvious from the proofs that the bounds given are the best possible in 
this case. A crucial lemma will state that any finite set of n elements of a Thomson or Hilbert 
geometry can be isometrically embedded in R™'™ ^ with, respectively, its Thompson or Hilbert 
metric. This lemma will allow us to extend the same bounds to more general cones, although 
in the general case the bounds may no longer be tight. 

A special case of Theorem |21 was proved in |29| using a simple geometrical argument. It 
was shown that if two particles start at the same point and travel along distinct straight-line 
geodesies at unit speed in the Hilbert metric, then the Hilbert distance between them is strictly 
increasing. This is equivalent to the special case of Theorem when dn(u,x) = du(u,y) and 
R approaches infinity. 

A consequence of Theorems ^ an d El is that both the Thompson and Hilbert geometries are 
semihyperbolic in the sense of Alonso and Bridson pQ . Recall that a metric space is semihyper- 
bolic if it admits a bounded quasi-geodesic bicombing. A bicombing is a choice of path between 
each pair of points. We may use the one given by 



C(x,2/)(*) 



y, 



otherwise 



for each pair of points x and y in the same part of C . Here d is either the Thompson or Hilbert 
metric. This bicombing is geodesic and hence quasi-geodesic. To say it is bounded means that 
there exist constants M and e such that 



d (C(x,v)(t))C(u>,*)(*)) < Mmax(d(x,w),d(y,z)) +e 
for each x,y,w,z<EC and t E [0, oo). 

Corollary 1 Each part of C is semihyperbolic when endowed with either Thompson's part met- 
ric or Hilbert's projective metric. 
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It should be pointed out that for some cones there are other good choices of distinguished 
geodesies. For example, for the cone of positive definite symmetric matrices Sym(n), a natural 
choice would be tf>(a;X,Y) := X 1 / 2 (X- 1 / 2 YX- 1 / 2 ) S X 1 / 2 for X, Y G Sym(n) and s G [0,1]. 
It can be shown that, with this choice, Sym(n) is non-positively curved in the sense of Busc- 
mann under both the Thompson and Hubert metrics. This result has been generalized to both 
symmetric cones an d to the cone of positive elements of a C*-algebra |1U|. 

Although Hilbert's projective metric arose in geometry, it has also been of great interest 
to analysts. This is because many naturally occurring maps in analysis, both linear and non- 
linear, are either non-expansive or contractive with respect to it. Perhaps the first example 
of this is due to G. Birkhoff 0E], who noted that matrices with strictly positive entries (or 
indeed integral operators with strictly positive kernels) are strict contractions with respect to 
Hilbert's metric. References to the literature connecting this metric to positive linear operators 
can be found in ^J^||- It has also been used to study the spectral radii of elements of Coxeter 
groups (201 • Both metrics have been applied to questions concerning the convergence of iterates 
of non- linear operators [HI El 1231 1241 • The two metrics have been used to solve problems 
involving non-linear integral equations [23 ISO) , linear operator equations 0121, and ordinary 
differential equations |51 l25llSTll32j . Thompson's metric has also been usefully applied in |24II26| 
to obtain "DAD theorems" , which are scaling results concerning kernels of integral operators. 
Another application of this metric is in Optimal Filtering ^j, while Hilbert's metric has been 
used in Ergodic Theory ^BJ and Fractal Diffusions |21| . 

2 Proofs 

A cone is a subset of a (real) vector space that is convex, closed under multiplication by positive 
scalars, and does not contain any vector subspaces of dimension one. We say that a cone is 
almost Archimedean if the closure of its restriction to any two-dimensional subspace is also a 
cone. 

The proofs of Theorems Q and [21 will involve the use of some infinitesimal arguments. We 
recall that both the Thompson and Hilbert geometries are Finsler spaces [221- If C is a closed 
cone in 1$L N with non-empty interior, then int C can be considered to be an iV-dimensional 
manifold and its tangent space at each point can be identified with R N . If a norm 

\v\x '■— infja > : —ax < v < ax} 

is defined on the tangent space at each point x G int C, then the length of any piecewise C 1 
curve a : [a, b] — > int C can be defined to be 

L T (a) := f \a'{t)\ T a{t) dt. 

J a 

The Thompson distance between any two points is recovered by minimizing over all paths 
connecting the points: 

d T (x,y) =M{L T (a) : a G PC^x^y]}, 

where PC 1 [a;,?/] denotes the set of all piecewise C 1 paths a : [0, 1] — > intC with a(0) = x and 
a(l) = y. A similar procedure yields the Hilbert metric when the norm above is replaced by 
the semi-norm 

\v\% := M(v/x) - m{v/x). 

Here M(v/x) is as before and m(v/x) := sup{A G M : v > Ax}. The Hilbert geometry will be 
Riemannian only in the case of the Lorentz cone. The Thompson geometry will be Riemannian 
only in the trivial case of the one-dimensional cone R+. 
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Our strategy will be to first prove the theorems for the case of the positive cone and 
then extend them to the general case. The proof in the case of R+ will involve investigation of 
the map g : intM^ -> intK?: 




g(x) := 4>(s; t,x) = < \ b — a J \ b — a ) (5) 



where b := b(x) := max^ Xi and a := a(x) := min^ Xi. Here s G (0, 1) is fixed and we are using 
the notation 1 := (1,...,1). The derivative of g at x G intR+ is a linear map from — > R . 
Taking | • |^ as norm on the domain and | • as norm on the range, the norm of g'(x) is 

|| 9 '0r)|| T := S up{\g'(x)(v)\^ x) :\v^<l}. 

If, instead, we take the appropriate infinitesimal Hilbert semi-norms on the domain and range, 
then the norm of g'(x) is given by 

\\g'(x)\\ H := S up{\g>(x)(v)\» x) :\v£ < 1}. 

For each pair of distinct integers / and J contained in {1, ... , N}, let 

Uu := |x G intR^ : < xj < X, < xj for all i G {1, . . -,N}\{I, J}\. 

On each set Uij, the map g is C 1 and is given by the formula 

XjXj XjXj 



0O) 



Xj — Xi 



Let U denote the union of the sets I, J £ {1, ... , N}, I ^ J. If x G Mrf\U, then there must 
exist distinct integers m,n £ {1, . . . , TV} with either 

The set x G R^ with x n = x m has (iV-dimensional) Lebesgue measure zero, so the complement 
off/ in R^ has Lebesgue measure zero. 

We recall the following results from [22]. The first is a combination of Corollaries 1.3 and 
1.5 from that paper. 

Proposition 1 Let C be a closed cone with non-empty interior in a finite dimensional normed 
space V. Suppose G is an open subset of intC such that (j)(s;x,y) G G for all x,y G G and 
s G [0, 1]. Suppose also that f : G — > intC is a locally Lipschitzian map with respect to the norm 
on V . Then 

inf{fc > : d T (f(x),f(y)) < kd T (x,y) for all x,y G G} = ess sup xeG \\f {x)\\ T . 

It is useful in this context to recall that every locally Lipschitzian map is Frcchct differentiablc 
Lebesgue almost everywhere. The next proposition is a special case of Theorem 2.5 in |22| . 

Proposition 2 Let C be a closed cone with non-empty interior in a normed space V of finite 
dimension N. Let I be a linear functional on V such that l(x) > for all x G intC, and 
define S :— {x G C : l(x) = 1}. Let G be a relatively-open convex subset of S. Suppose that 
f : G — * intC is a locally Lipschitzian map with respect to the norm on V . Then 

inf{fc > : d H (f(x),f(y)) < kd H (x,y) for all x,y G G} = ess sup xeG \\f'(x)\\f { , 

where \\f'(x)\\jf := su p{\f'( x )(v)\^ x -j ■ \v\ x < 1, l(v) =0}. Here the essential supremum is 
taken with respect to the N — 1-dimensional Lebesgue measure on S . 
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Since we wish to apply Propositions ^ and [2 to the map g, we must prove that it is locally 
Lipschitzian. 

Lemma 1 The map g : int(Mr?) — > mi(R^) defined by |3|) is locally Lipschitzian. 

Proof. We use the supremum norm ||x||oo := max^ \xi\ onl*. Clearly, \b(x) — b(y)\ < ||x — y||oo 
and \a(x) — a(y)\ < \\x — y\\oc for all x, y G int Therefore both a and b are Lipschitzian 

with Lipschitz constant 1. 

Let 7 : [0, oo) — > [0, oo) be defined by 

7 «):=(f^P fo "^- 
I s, for t = 1. 

Then g may be expressed as 

g (x) = a 3 - 1 j (b/a) x + a s (l-j (b/a) ) 1. 
The Binomial Theorem gives that 

(£)(*-!)* for |t- 1| < 1 

and so 7 is C°° on a neighborhood of 1. Hence it is C°° on [0, 00), and thus locally Lipschitzian. 
It follows that g is also locally Lipschitzian. □ 



2.1 Thompson's metric 

We have the following bound on the norm of g'(x) with respect to the Thompson metric. 

Lemma 2 Consider the Thompson metric on intM.^. Let x G U\ : n- If N — 1 or N = 2 then 
the norm of g' at x is given by ||<?'(;e)||t = s. If N > 3, then 

\\g'(x)\\ T - XN - XN ~ 1 Q ( XN \ X l +1 + ~ x l) x N-l + XN-1 ~ X ^ q f /gx 

X N - Xl \ Xi ) E N _x E N -l X N -X! \X N J £jV-l 

where 0(t) := (1 — s) — t s + st and Ei(x) :— Ei := Xi(x s N — x\) + xnx\ — x\X s N . 

Proof. If N = 1 and x > 0, then g(x) = x s . We leave the proof in this case to the reader and 
assume that N > 2. 
For x G Ui.n, 

( Xj\jX^ X\X ly 



g{x) 

Let 



( X N 






\XN 





\ xn — x\ 



g. t (x) dxj 

Straightforward calculation gives, for each j G {1, . . . , N}, 



hij(x) — sSij 
and h,Nj{%) = sSjyj. 
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Here £y is the Kronecker delta function which takes the value 1 if i = j and the value if i 7^ j. 
Clearly, hij(x) = for 1 < i < N and j £ {1, i, N}. For 1 < i < N, 



ha{x) 
ha(x) 
h iN (x) 



%N ~ Xi 



xn \ x\ +l 



Xl 



Ei 



x 



N 



X 1 



Ei ~ Xl 

Xj - Xi 

xn — xi 



Xl 

XN 



r 8 + l 



>0, 
>0, 
< 0. 



(7) 
(8) 
(9) 



Inequalities ([7HHJ) rely on the fact that 9(t) > for t > 0. This may be established by observing 
that (9(1) = 6' (1) = and 6"{t) > for t > 0. 
Let 



B T := {v e 1* : max., |^| < 1} 



We wish to calculate 



\g' (x)\ | T = sup j j hijVj 



1 < i < N, v e B 1 



(10) 



For i — 1 or i — N, we have | J^j hyfjl < s for any choice of v G B T . If AT = 2, then it 
follows that ||p'(a;)||r = s for all x 6 Ux,n- 

For the rest of the proof we shall therefore assume that N > 3. For 1 < z < N, it is clear 



from Inequalities (|7I9|I that i s maximized when v\ 



1 and vn 



-1. In this 



case 



1 



C\X l + c 2 
C3^i + C 4 ' 



I X-y + (Sjy — Xi)X{ 

Xl 



Xi Xi 
i 

X N — Xl 



Xl 
XN 



' N 



(11) 

(12) 



where ci, C2, C3, and C4 depend on x\ and xn but not on x^. Observe that c^Xi + C4 / 
for x\ < Xi < xn- Given this fact, the general form of Expression (|12J1 leads us to conclude 
that it is either non-increasing or non-decreasing when regarded as a function of a;,-. When we 



substitute Xi — x±, we get | ^ h 



j • "v v j 



5> 



s. When we substitute Xi = xn, we get 

2(i-(x 1 /x N y 

1 - (xi/x N ) 



(13) 



Now, writingr(i) := 2(l-i s )/(l-t)-s, wehaver'(t) = -2Ve{fT l )/(l-t) 2 < 0, in other words 
r is decreasing on (0, 1). In particular, T(xi/xn) > hm t _»i T(t) = s. Therefore Expression (|llfl 
is non-decreasing in X4. So, the supremum in (|10f> is attained when v is as above and i — N — 1. 
Recall that xn-i is the second largest component of x. The conclusion follows. □ 



Corollary 2 Let R > 0. If N = 1 or N = 2, then ess sup{||ff'(x)|| T : a; 6 mtR^} = s. If 
N > 3, i/ien 

2(1 _ e ~ Rs ) 

ess sup{|| 5 '(x)|| T : d H (a:,l) < R} = -± - s. 
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Proof. Note that if a : M.+ — » is some permutation of the components, then g o a(x) — 
a o g(x) for all x E Mr?. Furthermore, a will be an isometry of both the Thompson and Hubert 
metrics. It follows that, given any x G Ui ; j with I, J E {1, . . . , N}, I ^ J, we may reorder the 
components of x to hnd a point y in J7i,jv such that ||ff'(y)||T = Is'^IIt- Recall, also, that the 
complement of U in intR+ has TV-dimensional Lebesgue measure zero. From these two facts, 
it follows that the essential supremum of |</(x)||t over Br(1) := {x E intM^ : oIh(x, 1) < R} 
is the same as its supremum over 5^(1) n U±.n- 

In the case when N = 1 or N = 2, the conclusion follows immediately. 

For N = 3, we must maximize Expression (JBj under the constraints x\ < xjq-i < xn and 
xi/xn > exp(—R). First, we maximize over xn-i, keeping x\ and xn fixed. In the proof of 
the previous lemma, we showed that Expression (|HJ| is non-decreasing in xn-i, and so it will be 
maximized when xjv— l approaches xn- Here it will attain the value 



2(^1 - (Xi/XN)^ 

1 - (xi/x N ) 



T( Xl /x N ). (14) 



We also showed that T is decreasing on (0, 1). Therefore l|14|) will be maximized when x\/xn = 
exp(— R), where it takes the value 

2{l-e~ Rs ) _ 
1 - e-« ~ S ' 

□ 



Lemma 3 Let C be an almost Archimedean cone and let {xi : i & 1} be a finite collection 
of elements of C of cardinality n, all lying in the same part. Denote by W the linear span 
of {xi : i G 1} and write Cw ■= C D W. Denote by intC'w the interior of Cw as a subset 
of W, using on W the unique Hausdorff linear topology. Then each of the points Xf,i G I 
is contained in intCw- Furthermore, there exists a linear map F : W -> R™("- 1 ) such that 
F(intC w ) C and 

M( Xi / Xj ;C) = M(F(x i )/F(x i y,WL'£ n - 1) ) (15) 

for each i,j G /. 

Proof. Since the points {xt : i E 1} all lie in the same part of C, they also all lie in the 
same part of C'w- Therefore there exist positive constants a%j such that Xj — a^xi G Cw for 
all 2, j G /. If we define a := minja.y : i,j G /} it follows that Xj + bxi G Cw whenever \S\ < a 
and i,j G /. Now select i\, . . . ,i m El such that {xi k : 1 < k < m} form a linear basis for W. 
For each y E W , we define ||y|| := maxd^ : 1 < k < m}, where y = X^fcLi bkXi k is the unique 
representation of y in terms of this basis. The topology on W generated by this norm is the 
same as the one we have been using. If ||y|| < a/m and j G I, then Xj + mbkXi k E Cw for 
1 < k < m. It follows that 

Xj + y = — y^X x i + m ^kXi k ) E Cw 

771 k=l 

whenever \\y\ \ < a/m. This proves that Xj E int Cw for all j E I. 

It is easy to see that /3y := M{xi/xj]C) — M{xi/xj\Cw) f° r all i,j E I, i ^ j. Observe 
that ftijXj — Xi E dC'w- Since intCv^ is a non-empty open convex set which does not contain 
flijXj — Xi, the geometric version of the Hahn-Banach Theorem implies that there exists a linear 



8 



functional : W — > M and a real number r.y such that fijifiijXj — %i) < r^- < fij(z) for all 
z G int CW- Because is in the closure of int CV and /jj(0) = 0, we have < 0. On the other 
hand, if fij(z) < for some z G intCy^, then considering fij(tz) we see that would not be 
bounded below on intCV- It follows that ry = 0. Since /3yXj — Xi is in the closure of intCV, 
we must have fij(PijXj — Xi) = 0. 
Now, define 

F:W^ R n ( n_1 ) . 2 _ (/ y (z)) <ii6J>w , 

so that /y- (2) ; « , j G /, 2 7^ j are the components of F(z) . Clearly, F is linear and maps int Cw 
into intR™*-" . Also, for all i, j G /, i 7^ j, 

M(F(a; i )/F(x i );K+ ( "" 1) ) = mf{A > : /jw(Axj - x,) > for all k,l € I, k ^ I}. 

For A > /Jy, we have Xxj — Xi G cl Cw and so fui^Xj — Xi) > for all k,l £ I, k ^ I. On 
the other hand, for A < /3y, we have fij(Xxj — Xi) < since fij(xj) > 0. We conclude that 

M(F(a; i )/ J F(^);M+ ( " _1) ) = fa. □ 



Lemma 4 Theorem^ holds in the special case when C = ruitfi TV > 3. 

Proof. Each part of consists of elements of all having the same components equal to 
zero. Thus each part can be naturally identified with intR™ , where n is the number of strictly 
positive components of its elements. We may therefore assume initially that {x, y, u} c intR+ . 
Define L : R w — » M. N by L(z) :— (u\Z\, . . . , unzn)- Its inverse is given by L^ 1 (z) := 
Z\, . . . , u^f Zn). Both L and L~ x are linear maps which leave invariant. It follows 
that L and L^ 1 are isometries of R^ with respect to both the Thompson and Hilbcrt metrics. 
Therefore, for u, z G intR?, 

L- 1 {4>{s;u,z)) = (j>{s;L- l {u),L- 1 {z)). 

Thus, we may as well assume that u = 1. 

We now wish to apply Proposition^with / := g and G := Bn +e (t) = {z£ R^ : oIh{z, 1) < 
R + e}. It was shown in [22] that G is a convex cone, in other words that it is closed under 
multiplication by positive scalars and under addition of its elements. Since <p(s; w,z) is a positive 
combination of w and z, it follows that <p(s; w, z) is in G if w and z are. If we now apply Lemma^ 
Proposition^] and Corollary|21 and let e approach zero, we obtain the desired result. □ 



Lemma 5 Theorem holds in the special case when the linear span of {x,y,u} is one- or 
two- dimensional. 

Proof. Let W denote the linear span of {x, y, u}, in other words the smallest linear subspace 
containing these points. By Lemma x, y, and u are in the interior of C fl W in W . It is easy 
to see that M(z/w; C) — M(z/w; C fl W) for all w,z G int (C fl W). Therefore, we can work in 
the cone C C\W . 

It is not difficult to show that if m := dimW 7 is either one or two, then there is a 
linear isomorphism F from W to R m taking int (C fl W) to intR™. It follows that F is an 
isometry of both the Thompson and Hilbert metrics and F(<f>(s; z,w)) — cj>(s; F(z), F(w)) for 
all z,w G int (C fl W). We may thus assume that C = R™ and u,x,y G int C . 

As in the proof of Lemma 0] we may assume that u = 1. 

To obtain the required result, we apply Lemma^ Corollary|21 and Propositionnwith / := g 
and G := intMT. □ 
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Proof, [of Theorem ^ Let W denote the linear span of {x,y,u}. Lemma [3] handles the 
case when these three points are not linearly independent; we will therefore assume that they 
are. Thus the five points x, y, u, (j)(s;u,x), and <fi(s;u,y) are distinct. We apply Lemma[j|]and 
obtain a linear map F : W — > with the specified properties. From (|15J) . it is clear that 
dx{z, w) — cIt'(F(z), F(w)) for each z,w £ {x, y, u, <fi(s; u, x), <f>(s; u, y)}. Here we are using dr' 
to denote the Thompson metric on Note that cf>(s; u, x) is a positive combination of u and x 
and that the coefficients of u and x depend only on s, M(u/x; C), and M(x/u; C). The latter two 
quantities are equal to AI(F(u)/ F(x); R 2 ^) and M(F(x)/F(u)- 1 R 2 ^) respectively. We conclude 
that F((j)(s; u, x)) = (f>(s; F(u), F(x)). A similar argument gives F((j)(s; u, y)) = 0(s; F(u), F(y)). 
Inequality JSJ) follows by applying Lemma 01 to the points F(x), F(y), and F(u) in the cone 

R20. ' ' ' □ 



2.2 Hilbert's metric 

We shall continue to use the same notation. Thus, for a given N S N and s s (0, 1), we use g to 
denote the function in © and U to denote the union of sets Uij with I, J G {1, ... , N}, I ^ J. 
We also use the functions 9{t) :— (1 — s) — t s + st and Ei(x) := Ej := Xi{x s N — x\ ) + xnx\ — x\X s N , 
and write hij(x) := (xj / gi(x))dgi/ dxj(x). As was noted earlier, 0(t) > if t > and t ^ 1. 
Also, j(t) :— (1 — t s )/(l — t), 7(1) :— s is strictly decreasing on [0, 00). We shall also use the 
simple but useful observation that if c\, C2, C3, and C4 are constants such that c^t + C4 ^= 
for a < t < b, then the function t 1— > [c\t + C2)j(cj,t + C4) is either increasing on [0,6] (if 
C1C4 — C2C3 > 0) or decreasing on [a, b] (if c\c^ — C2C3 < 0). Either way, the function attains is 
maximum over [a, b] at a or b. 

Recall that if g is Frechet differentiable at x £ intK^ then ||3'(a;)||jj denotes the norm of 
g'{x) as a linear map from (1*, || • ||^) to (1*, || • ||^ x )), although, of course, || ■ \\^ and || ■ Wf^ 
are semi-norms rather than norms. 

Lemma 6 Consider the Hilbert metric on intM.^ with N > 2. Let x £ U\^m- If N = 2 then 
the norm of g' at x is given by \ \g'{x)\\H = s. If N > 3, then 

, = x_ E -x Ilzle + (*' N -xi)xN-i (16) 

xn - xi \ xi J E N -i E N ^i 

Proof. The norm of g'(x) as a map from (1*, || ■ ||^) to (M. N , || • is given by 



||g'(a;)||K= sup maxV^-Zij-jjuj, 



where 

B n := 



ff := yv € R w : maxj — mim,- < lj. 



To calculate | |g'(a;) |ir we will need to determine the sign of hij — hkj for each i, j, k G {1, . . . , TV}. 
We introduce the notation 

L ik := sup 22(hij ■- h kj )vj. (17) 

Note that g is homogeneous of degree s, in other words g(Xx) — X s g(x) for all x £ and 
A > 0. Therefore, 
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for each i G {1, . . . , N}. Thus J^j h>ij = s f° r ca ch i £ {1, . . . , N}, a fact that could also have 
been obtained by straightforward calculation. It follows that 

^2(hij - h kj )v 3 = ^(hij - h kj ){vj + c) (18) 
j j 

for any constant cel. 

It is clear that an optimal choice of v in 117|) would be to take Vj := 1 for each component j 
such that hij — hkj > and Vj := for each component such that h^ — h k j < 0. Alternatively, 
we may choose Vj := when hij — h k j > and Vj := —1 when hij — hkj < 0. That the optimal 
value is the same in both cases follows from <|18ll . Also, it is easy to see that Lik = Lki- 

Fix i, k G {1, . . . , N} so that i < k. There are four cases to consider. 

• Case 1. 1 < i < fc < iV. Recall that h\j(x) — s5\j and hMj{x) = sSjyj- A calculation 
using Equations gives 

Ei{x)E k (x)(ha(x) - hki(x)) = x s N x\ +1 {x k - ^)^(^) > 

and 

Ei(x)E k (x)(h iN (x) - h kN {x)) = xlx^ixk - Xi)6[—) > 0. (19) 

v xn ' 

We also have that hu(x) — hki(x) = hu(x) > and hik(x) — h k k(x) — ~hkk{x) < 0. So 
an optimal choice of v G B H in Equation l|17fl is given by Vj :— ~5jk- We conclude that 
Lik = hkk in this case. 

• Case 2. 1 = i < k < N. We will show that hki(x) < hn(x) — s. Consider X\ 
and xn as fixed and Xk as varying in the range x\ < Xk < xjy. From Equation (|T|). 
hki{x) = {c\Xk + C2)/ \c3Xk + C4), where c%, C2, C3, and C4 depend on x\ and xjv, and both 
C3 and C4 are positive. A simple calculation shows that C1C4 — C2C3 = —9{xn/xi)x\ +1 x s n , 
which is negative. Hence hk\ is decreasing in Xk and takes its maximum value when 
Xk — x\. Here it achieves the value 

xi e ( x N\ = g _ x{~ s (x s N -xj) < s 
xn — xi V xi ) xn — x\ 

Thus we conclude that h\\{x) — hk\{x) > 0. We also have that h\k{x) — hkk{x) = 
—hkk(x) < and /iiat(x) — hkN(x) — ~hkN(x) > 0. Thus the optimal choice of v £ B H 
is given by Vj := —5j k - We conclude that in this case Lik(x) = hkk{x). 

• Case 3. 1 < i < k — N. Here hn > h^i = 0, ha > h^i — 0, and hiN < h^N = s. So 
the optimal v £ B is given by Vj := Sji + Sji. We conclude that Lay — hn + ha. 

• Case 4. i = 1 and k = N. Here s = h\\ > h^i — and = hi^ < h^M = s. Thus the 
optimal v £ B H is given by Vj := Sij. We conclude that L\m = s. 

If N = 2 then Case 4 is the only one possible, and so ||<7'(a;)||ij = s. So, for the rest of the 
proof, we will assume that N > 3. 

We know that hn(x) + hu(x) = s — hny(x) > s so Case 3 dominates Case 4, that is to say 
LiN{x) > Lim{x) for i > 1. Since hn(x) > for i £ {1, . . . , N}, Case 3 also dominates Cases 1 
and 2, meaning that Li^{x) > Li k {x) for k < N , i < k. 

The final step is to maximize Lnq[x) = hn(x) + hu(x) = s — hi^(x) over i G {2, . . . , N — 1}. 
From (|19fl . h m iq(x) > h n N(x) for m < n. Thus the maximum occurs when i = iV— 1. Recall that 
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we have ordered the components of x in such a way that Xn-i is the second largest component 
of x. We conclude that 

= m ax L ik = hjy-1,1 + fr,jv-i,AT-i 

i,k:i<k 

By substituting the expressions in 10 and (jHJ), we obtain the required formula. □ 

Corollary 3 Let R > and N > 2. Let I be a linear functional on M. N such that l(x) > for all 
x G intRf and define S:={i€lf: l(x) = 1}. If N = 2, then ess sup{\\g' (x)\\ H : x € S} = s. 
IfN>3, then 

1 - e~ Rs 

ess sup{||g> (x)\\ H ■ d H {x, 1) < R, x e S} = 5-. 

1 — e n 

In both cases, the essential supremum is taken with respect to the N — 1- dimensional Lebesgue 
measure on S. 

Proof. Note that the complement of U (~l S in S has N — 1-dimensional Lebesgue measure 
zero. Using the reordering argument in the proof of Corollary we deduce the result in the 
case when N — 2. 

The case when ./V > 3 reduces to maximizing the right hand side of i|lt)|) subject to the 
constraints x\ < ajjv-i < xn and xi/xn > exp(— R). We can write the expression in l|16|) in 
the form s + (c\Xn-i + C2)/(c3-ttv-i + C4), where c\, C2, C3, and C4 depend only on x\ and Xn 
and ci > 0, C2 < 0, C3 > 0, C4 > 0. It follows that, if we view x\ and xn as fixed and xn-i as 
variable, the expression is maximized when x^-i = x^. The value obtained there will be 

1 - (x 1 /x N y 

- — = 7(xi/xjvJ- 

1 - (xi/x N ) 

If we recall that 7 is decreasing on [0, 1) and x\/xn > exp(— R), we see that 



\9'(x)\\h< 



1 - e~ Rs 
l-e~ R ' 



If xi/xn = cxp(— R), then, by choosing x £ J/^jv with xn-i close to xn, we can arrange that 
I Iff' (a;) Hi/ is as close as desired to this value. □ 



Lemma 7 Theorem^ holds in the special case when C = R+ with N >'S. 

Proof. As in the proof of Lemma we may assume that x,y £ intR^X and u = 1. Define 
I : R N -> R by l(z) := z t /N and let S ;= {x G K+ : = 1}. Then J is a linear functional 

and l(z) > for all z G intR+. It is easy to check that (f)(s; Az, fiw) — X 1 ~ s fi s ^)(s;z,w) for all 
A, fi > and z, w G intRf . Thus 

dH ln s; in ' Kx) )> n s; zoo* wv ) = dH{ ^ S] x),<t,{s; u ' 

We also have that d,H(x/l(x),y/l(y)) = dn{x,y). Therefore we may assume that x,y G S. Let 
e > and define G := {z G 5 : rfi/(z, 1) < i? + e}. It was shown in that G is convex. 
Also, Lemma ^ states that g is locally Lipschitzian. We may therefore apply Proposition 
with / := g. Since g is homogeneous of degree s, we have that g'(x){x) — sg[x) for all x G G. 
This, combined with the fact that ^(x)^^ = 0, implies that ||g'(x)||^ = ||p'(a;)||i/. Using 
Corollary|21 and letting e approach zero, we deduce the required result. □ 
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Lemma 8 Theorem^ holds in the special case when the linear span of {u,x,y} is 1- or 2- 
dimensional. 

Proof. If the linear span of {u,x,y} is one-dimensional, then all Hilbert metric distances are 
zero, so assume that it is two-dimensional. The same argument as was used in Lemma [S] shows 
that it suffices to prove the result for C = R+, u = 1, and x, y G intR^. As shown in the proof 
of Lemmad we may assume that l(x) = l(y) = 1 where I ((21,22)) := (zi + z^jl. We now apply 
Proposition 2 with / := g and G :— S := {z G intR+ : l(z) = 1}. Again, ||<7'(a;)||# = ||5'(a;)||/f 
for all x G G. The result follows from the first part of Corollary □ 

Proof, [of Theorem |2] The proof uses Lemmas and |H1 and is exactly analogous to the proof 
of Theorem □ □ 

Proof, [of Corollary^ We first prove the result for the case of Thompson's metric. We will 
use the alternative characterization of semihyperbolicity given in Lemma 1.2 of 1 . Suppose 
x, y, x 1 , y' G C are all in the same part and are such that neither drix, x') nor dr(y, y') is greater 
than 1. Let t G [0, 00) and write z := C(x,y){t) and w := <fi(dr(x, z)/cIt(x, y); x,y'). Observe that 
dr{y,y') < 1 implies \dr{x,y) — dsr(x,y')\ < 1. Since dr(x,w) = dr(x, y r )dr(x, z)/dr(x, y), we 
have 

\dr(x, w) — dr(x, z)\ < dr(x, z)/dr{x, y) < 1 
Similar reasoning allows us to conclude that 

\dr(x, w') — dr(x' , z')\ < 1, 

where z' := Gx',y')(t) an d w' := 4>(dr(x', z')/dr(x', y'); x, y'). From dr(%, z) — min(i, dr(x, y)) 
and dr(x\ z') = min(i, ', y 1 )), we have that 

\d T (x,z)-d T (x',z')\ < \d T (x,y) -d T (x',y')\ < 2. 

So 

dr(w, w ) — \dr(x, w) — dr(x, w')\ < 4. 

By Theorem HI d T (z,w) < 2d T (y,y') < 2 and d T (z',w') < 2d T (x,x') < 2. The triangle 
inequality gives dr(z,z') < dr(z,w) + dr(w,w') + dr(w',z') < 8. This is the uniform bound 
required by the characterization of semihyperbolicity we are using. 

The proof that C is semihyperbolic when endowed with Hilbert's metric is similar. □ 
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